Inter-Rater Reliability: Dependency on Trait Prevalence and Marginal Homogeneity

نویسنده

  • Kilem Gwet
چکیده

Researchers have criticized chance-corrected agreement statistics, particularly the Kappa statistic, as being very sensitive to raters’ classification probabilities (marginal probabilities) and to trait prevalence in the subject population. Consequently, several authors have suggested that marginal probabilities be tested for homogeneity and that any comparison between reliability studies be preceded by an assessment of trait prevalence among subjects. The objective of this paper is threefold: (i) to demonstrate that marginal homogeneity testing does not prevent the unpredictable results often obtained with some of the most popular agreement statistics, (ii) to present a simple and reliable inter-rater agreement statistic, and (iii) to gain further insight into the dependency of agreement statistics upon trait prevalence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A comparison of Cohen’s Kappa and Gwet’s AC1 when calculating inter-rater reliability coefficients: a study conducted with personality disorder samples

BACKGROUND Rater agreement is important in clinical research, and Cohen's Kappa is a widely used method for assessing inter-rater reliability; however, there are well documented statistical problems associated with the measure. In order to assess its utility, we evaluated it against Gwet's AC1 and compared the results. METHODS This study was carried out across 67 patients (56% males) aged 18 ...

متن کامل

Evaluation of Spasticity Using the Ashworth Scale with Intermediate Scores (ASIS)

Objectives: The main purpose of this research was to study and contribute to an accurate test of spastic limb. The intra, inter rater reliability of the test was examined. Methods: The present study was carried out in two parts In the first part of the study, the modified Ashworth Scale with Intermediate Scores (ASIS) was studied. During the second part of the study the intra, inter rater re...

متن کامل

Reliability of light microscopy and a computer-assisted replica measurement technique for evaluating the fit of dental copings

The aim of this in vitro study was to assess the reliability of two measurement systems for evaluating the marginal and internal fit of dental copings. Sixteen CAD/CAM titanium copings were produced for a prepared maxillary canine. To modify the CAD surface model using different parameters (data density; enlargement in different directions), varying fit was created. Five light-body silicone rep...

متن کامل

Test-Retest and Inter-Rater Reliability Study of the Schedule for Oral-Motor Assessment in Persian Children

Objectives: Reliable and valid clinical tools to screen, diagnose, and describe eating functions and dysphagia in children are highly warranted. Today most specialists are aware of the role of assessment scales in the treatment of affected individuals. However, the problem is that the clinical tools used might be nonstandard, and worldwide, there is no integrated assessment performed to assess ...

متن کامل

Inter-rater and intra-rater reliability in the interpretation of MTI Photoscreener photographs of Native American preschool children.

PURPOSE To evaluate inter- and intra-rater reliability for the interpretation of MTI Photoscreener photographs taken in a population of Native American preschool children with a high prevalence of astigmatism. METHODS Photographs of 369 children were rated by 11 nonexpert and 3 expert raters. Photographs for each child were scored as pass, refer, or retake. Nonexpert raters scored photos on t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002